Tested local Wan video gen on a Radeon 8060S (Strix Halo, 48GB UMA, Windows). ZLUDA can't run stock PyTorch; AMD's TheRock gfx1151 wheel gives native ROCm. FastWan 1.3B in 4min, Wan 14B I2V in 13.6min — VAE decode and 16GB-RAM Segfaults are the real limits.
Tested FramePack F1 on an RTX 4060 Laptop (8GB VRAM, 32GB RAM): VRAM peaked at 5.75GB, but the 26GB model overflowed RAM into the pagefile and a 5s clip took 56 min. The real bottleneck for local video gen on a laptop is RAM, not VRAM.
WAN 2.2 image-to-video on Windows + RTX 4060 8GB VRAM in ComfyUI. The 5B fp8 model failed three times; the 14B Rapid distilled model with --lowvram offloading produced a 2-second clip in 111 seconds — vs 82 minutes on M1 Max 64GB. Working setup and what to avoid.
Local video generation test on M1 Max 64GB MacBook Pro. FP8 models don't work on Metal — switching to GGUF got Wan 2.2 running at 82 minutes for a 2-second clip. LTX-2 produced NaN or unusable KSampler output under MPS. Specs, failed configs, and the working setup.
Right after Seedance 2.0 launched, a torrent of Hollywood IP infringement flooded social networks. Disney, Netflix, and Paramount sent cease-and-desist letters; the API release was postponed indefinitely, and face-cloning and person reference features were disabled.
As of February 2026, the Seedance 2.0 API is not yet public. This article summarizes the outlook for ComfyUI integration once the API is released and the preparations to make.
ByteDance’s Seedance 2.0 has been released on Dreamina. From the perspective of someone who has been using Wan 2.x and ComfyUI locally, I considered how the "ease" differs between local and cloud-based video generation services.
A summary of ComfyUI's 'The Complete AI Upscaling Handbook' covering the difference between conservative and creative upscaling, model selection by use case, and benchmarks for both image and video.
Published as an official ComfyUI workflow, InfiniteTalk is a lip-sync model specialized in generating mouth animation from audio files. This article covers how it differs from MOVA and Vidu Q3 and what models it requires.
AnimeGamer, developed by Tencent ARC Lab, generates anime-style videos while tracking game-state transitions. It takes a fundamentally different approach from general-purpose video generation models.
MOVA-720p from the OpenMOSS team is an open-source model that generates video and audio in a single pass. This article covers how it differs from closed models like Vidu Q3 and what its architecture looks like.
Robbyant, an Ant Group subsidiary, released LingBot-World, a world model that generates interactive video in real time from a single image. This article covers how it differs from conventional video generators, its technical features, and Apple Silicon support.